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Objective: Spinal muscular atrophy (SMA) is one of the most common severe hereditary diseases of infancy and early 
childhood in North America, Europe, and Asia. SMA is usually caused by deletions of the survival motor neuron 1 
{SMN1) gene. A closely related gene, SMN2, modifies the disease severity. SMA carriers have only 1 copy of SMN1 and 
are relatively common (1 in 30-50) in populations of European and Asian descent. SMN copy numbers and SMA carrier 
frequencies have not been reliably estimated in Malians and other sub-Saharan Africans. 

Methods: We used a quantitative polymerase chain reaction assay to determine SMN1 and SMN2 copy numbers in 
628 Malians, 120 Nigerians, and 120 Kenyans. We also explored possible mechanisms for SMN1 and SMN2 copy 
number differences in Malians, and investigated their effects on SMN mRNA and protein levels. 

Results: The SMA carrier frequency in Malians is 1 in 209, lower than in Eurasians. Malians and other sub-Saharan 
Africans are more likely to have >3 copies of SMN1 than Eurasians, and more likely to lack SMN2 than Europeans. 
There was no evidence of gene conversion, gene locus duplication, or natural selection from malaria resistance to 
account for the higher SMN1 copy numbers in Malians. High SMN1 copy numbers were not associated with 
increased SMN mRNA or protein levels in human cell lines. 

Interpretation: SMA carrier frequencies are much lower in sub-Saharan Africans than in Eurasians. This finding is 
important to consider in SMA genetic counseling in individuals with black African ancestry. 
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Spinal muscular atrophy (SMA) is caused by deletions 
and other mutations in the survival motor neuron 1 
(SMN1) gene at chromosome 5ql 3. 1 3 These genetic 
lesions result in loss of a-motor neurons, leading to mus- 
cle weakness and atrophy. SMN2, which is highly homol- 
ogous to SMN1, modifies the severity of disease. 4 ' 5 
About 95 to 98% of SMA patients are homozygous for 
deletion of SMN1, and the remaining 2 to 5% are com- 
pound heterozygotes for deletion and other mutation in 
SMN1. SMA is a leading genetic cause of infant mortal- 
ity, with an estimated incidence of 1 in 8,000 to 10,000 
live births and a carrier frequency of 1 in 30—50 in pop- 
ulations of European ancestry. 6 Recent data suggest that 
the SMA carrier frequency is lower in persons of black 
African ancestry in Cuba, 7 South Africa, 8 and the United 
States. 9 

Despite the abundance of other autosomal recessive 
neuromuscular diseases due to relatively high consanguin- 
ity in Mali, 10 only 1 case of SMA has been clinically 
diagnosed in the country (unpublished data). Given the 
low incidence of SMA in Mali, we hypothesized that the 
SMA carrier frequency in this country is low. Here we 
show that in Malian and other sub-Saharan African pop- 
ulations, SMN1 copy numbers are higher than in Euro- 
peans and Asians. We also explore possible explanations 
for SMN copy number variation in Mali, and investigate 
relationships between SMN copy number and SMN 
mRNA and protein levels. 

Subjects and Methods 

Sampling of Healthy Controls and DNA 
Extraction 

Protocols were approved by the neurosciences institutional 
review board (IRB) at the National Institutes of Health (NIH), 
and the Ethical Committee at the Faculty of Medicine and 
Odontostomatology (FMOS), University of Bamako. All partic- 
ipants provided written informed consent. 

Healthy adult FMOS students of Malian descent and 
nationality were eligible for the study. In a pilot study we col- 
lected blood samples from 40 donors at the NIH Blood Bank 
and 30 students at the FMOS. In addition, we used 15 samples 
collected from healthy controls for a previous NIH study. 11 We 
subsequently recruited another 671 students at the FMOS. 
Genomic DNA was extracted from whole blood at the NIH 
and from buffy coats at the FMOS using the Centra Puregene 
blood kit (Qiagen, Gaithersburg, MD). DNA samples were 
shipped to the NIH, and aliquots were shipped to Integrated 
Genetics (Westborough, MA) for SMN1 copy number 
determination. 

We also obtained DNA samples from healthy controls of 
Luhya (Kenya, n = 120) and Yoruba (Nigeria, n = 120) eth- 
nicities from the Coriell Institute (Camden, NJ). 



Quantification of SMN Copy Number 

SMN1 and SMN2 copy numbers were quantified in 628 and 
613 of the 671 Malian samples, respectively. SMN1 copy num- 
ber was quantified in 542 of the samples at Integrated Genetics 
and an additional 86 samples at the NIH. SMN2 copy num- 
bers were quantified in all 613 samples at the NIH. The copy 
number determinations were done by quantitative real-time 
polymerase chain reaction (qPCR) technique based on Taqman 
technology (Life Technologies, Carlsbad, CA; Roche Molecular 
Systems, Pleasanton, CA). 9 ' 12 The same methods were used at 
the NIH to quantify SMN1 and SMN2 copy numbers in the 
Nigerian and Kenyan samples. Primers and methods for the 
SMN copy number estimation were previously published. 11,12 

Identification of SMN Hybrid Genes 

We amplified SMN from intron 6 to exon 8 by PCR from 
genomic DNA using previously reported primers and condi- 
tions 13 (12 NIH samples and 20 Malian samples with >3 
SMN1 copies). We then used the TA cloning kit (Life Technol- 
ogies) to subclone 15 to 20 colonies per sample. PCR products 
from each clone were subsequently digested by Ddel and 
EcoRV. Tris-borate-EDTA gel electrophoresis was done as 
published. 13 

PCR products for each clone were also sequenced at the 
National Institute of Neurologic Disorders and Stroke DNA 
sequencing facility. Sequences were analyzed based on the 5 
known nucleotide differences between SMN1 and SMN2 intron 
6 to exon 8. 14 SMN hybrids were identified by the association 
of exon 7 from SMN1 and exon 8 from SMN2, or vice versa. 
To validate results, we amplified and sequenced SMN intron 6 
to exon 8 in 18 Nigerian samples (11 with 3—4 copies of 
SMN1 and 7 with 2 copies of SMN1) and 10 NIH and 10 
Malian samples known to lack SMN2 (16 colonies per sample). 

SMN1 Copy Number and Malaria Susceptibility 

We obtained 1,204 genomic DNA samples from a cohort of 
children aged 6 months to 17 years in the village of Kenieroba, 
Mali. This cohort was followed through 4 complete transmis- 
sion seasons (2008—2011) to record the frequency and severity 
of all Plasmodium falciparum malaria episodes (Lopera-Mesa 
et al, in preparation). We quantified SMN1 copy number in 
the samples by qPCR using SYBR Green I dye (Life Technolo- 
gies). We performed a Poisson regression on malaria episodes 
using SMN1 copy number and other variables known or sus- 
pected to influence malaria incidence and parasite density. The 
cohort study was also approved by the Ethical Committee at 
FMOS and by the IRB at the National Institute of Allergy and 
Infectious Diseases. Parents or guardians of the children pro- 
vided written informed consent. 

SMN1 Copy Number and Gene Duplication at 
the SMA Locus 

We performed a real-time SYBR Green I dye— based qPCR assay 
using primers 15 to quantify the copy numbers of SMN1 exon 
7, SMN2 exon 7, and the £MV7-neighboring genes NAIP 
(neuronal apoptosis inhibitory protein) exon 5 and H4F5t exon 
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2 in 200 Malian DNA samples. Three Tunisian DNA samples 
with known copy number were used as controls. 15 We com- 
pared the copy numbers of SMN1 exon 7 and the other genes 
to assess duplication at the SMA locus, using the Pearson chi- 
square test and p < 0.05 to indicate a significant difference. 

SMN Gene Localization by Fluorescent In Situ 
Hybridization Analysis 

Metaphase preparations from Epstein— Barr virus— transformed 
lymphoblastoid cells were done by standard air-drying technique, 
and fluorescent in situ hybridization (FISH) was done with labeled 
DNA by nick-translation technique, essentially as described. 16 Fifty 
nanograms of labeled probe (Pi3996, a 28kb genomic SMN clone 
kindly provided by Arthur Burghes, Ohio State University, Colum- 
bus, OH) was applied to each slide. Blocking, hybridization, and 
counterstaining were done as previously described. 16 

SMN Expression Analysis by qPCR 

We obtained 24 lymphoblastoid cell lines with 1 to 4 copies of 
SMN1 from Nigerian and Kenyan donors (Coriell Institute). 
RNA and cDNA were prepared as described. 1 1 

We used Taqman qPCR to determine SMN1 copy num- 
bers in 39 DNA samples from unrelated individuals in the 
Centre d'Etude du Polymorphisme Humain (CEPH) collection 
(Paris, France). We then used SMN mRNA expression data for 
the corresponding CEPH lymphoblasts from GeneNetwork at 
the University of Tennessee (http://www.genenetwork.org/ 
webqtl / WebQTL.py?cmd= sch&refseq = NM_000344&species = 
human; record ID ILMN_1665022) and compared the mean 
SMN mRNA level to the SMN1 copy number. 

We determined SMN1 and SMN2 copy numbers and 
SMN mRNA expression levels in 1 5 human induced pluripotent 
stem (iPS) cell lines from non-SMA individuals (7 healthy con- 
trols, 6 patients with spinal and bulbar muscular atrophy, and 2 
patients with a-sarcoglycan mutations). The iPS cells were made 
by lentiviral transduction or mRNA transfection with the reprog- 
ramming factors Oct4, Klf4, Sox2, and c-Myc (Millipore, Biller- 
ica, MA, and Stemgent, Cambridge, MA, respectively) . 

Western Blot Analysis 

We extracted protein lysates from the fibroblast and lympho- 
blastoid cell lines and performed a semiquantitative Western 
blot using 40 fig of protein lysate and human monoclonal anti- 
SMN antibody as described previously. 17 

Statistical Methods 

Confidence intervals (CIs) on proportions were calculated by 
exact (Clopper— Pearson) method, and comparisons between 
proportions used the central Fisher exact test and the associated 
CIs. To test for a relationship to malaria, we performed 2 
models: a Poisson regression on the malaria episodes adjusted 
for linear overdispersion and a generalized estimating equation 
model on the logio parasite (P. falciparum) density values. As in 
Lopera-Mesa et al (in preparation), the models adjusted for age, 
hemoglobin type, ^-thalassemia, glucose-6-phosphate dehydro- 
genase deficiency, ABO blood group, village, sex, and ethnicity. 



SMN1 copy number was included as a continuous variable not 
rounded to the nearest integer. 

We used Spearman rank correlation with correction for 
ties to examine the association between SMN mRNA expres- 
sion and SMN1 copy number, and used Fisher exact test to 
examine the association between gender and SMN1 copy 
number. 

Results 

High SMN1 and Low SMN2 Copy Numbers in 
Sub-Saharan Africans 

We determined SMN1 copy number in individuals of 
black African ancestry (628 Malians of mixed ethnicity, 
120 Nigerians of Yoruba ethnicity, and 120 Kenyans of 
Luhya ethnicity). Malian adult participants were 70% 
male and had a mean ± standard deviation age of 22 ± 
2.2 years. The frequency of SMN1 copy numbers in 
these 3 populations and from other healthy populations 
reported in the literature 9 ' 1 } ~ 22 are listed (Table 1). In 
our study, samples with 4 or 5 copies of SMN1 could 
not be reliably distinguished from each another and were 
combined. The frequency of SMA carriers with just 1 
copy of SMN1 was 1 in 209 (3 in 628; 95% CI = 1 in 
74-1,014) in the Malian population. Overall, the SMN1 
copy number distribution was significantly different 
between the Malians and other sub-Saharan Africans 
compared to previously reported European and Asian 
populations if < 0.0001). The distribution of SMN1 
copy number for the Malians was also significantly differ- 
ent from the Nigerians {p = 0.006) and Kenyans (p = 
0.002), likely due to 53% of the Malians having >3 cop- 
ies of SMN1, compared to 38% for both the Nigerian 
and Kenyan samples. There was significant association 
between SMN1 copy number and gender in the Malians 
(p = 0.002), likely due to the males having a higher pro- 
portion (57%) of >3 copies of SMN1 compared to 41% 
for females. 

We also determined SMN2 copy number in 613 
Malian, 120 Nigerian, and 120 Kenyan samples, and 
compared the copy number distribution to those of 
healthy subjects reported in the literature (Table 2). The 
SMN2 copy number distribution in Malians was signifi- 
cantly different from Nigerians (p = 0.02) but not Ken- 
yans (p = 0.26). When we compare the results in sub- 
Saharan Africans from our study with previously reported 
results in Europeans (see Table 2), the percentage of Afri- 
cans with no SMN2 (19—27%) is significantly higher 
(odds ratio = 3.7; 95% CI = 2.8-4.8; p < 0.0001) 
than in individuals of European ancestry (8-9%). I9 ' 23 

Causes of SMN Copy Number Differences 

The differences in SMN1 and SMN2 copy numbers 
between sub-Saharan Africans and Eurasians could be 
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due to increased SMN2 to SMN1 gene conversion, natu- 
ral selection of high SMN1 copy number (e.g., for pro- 
tection against 7? falciparum malaria), or SMN1 
duplication at the SMA locus or elsewhere in the African 
genome. 

To investigate a role for gene conversion, we used 
standard genotyping to identify 20 Malian and 12 NIH 
samples with >3 SMN1 copies. We found that 14% (49 of 
343) and 9% (15 of 173) of SAW clones were SMN1/2 
hybrids by restriction digestion in the Malian and NIH 
samples, respectively (Table 3). Identification of hybrid 
clones by sequencing gave similar results (see Table 3) . The 
restriction digestion results were also similar in 1 1 Nigerian 
samples, particularly those with >3 SMN1 copies, where 
4% of clones (7 of 165) were SMN hybrids. The frequency 
of SMN hybrid clones increased with SMN1 copy number; 
we found no SMN hybrids in 10 Malian and 10 NIH sam- 
ples with a 2SMN1/0SMN2 genotype. Thus, SMN hybrid 
genes may account for part but not all of the differences in 
SMN1 and SMN2 copy numbers we observed. 

To explore whether high SMN1 copy numbers may 
have been naturally selected for resistance to P. falciparum 
malaria or high parasite densities, we included our 
SMN1 copy data in an analysis of genetic resistance to 
these outcomes (Lopera-Mesa et al, in preparation). We 
saw no difference in average number of malaria episodes 
by type of episode or whether SMN1 copy number is 
>3 (Table 4). In an analysis that accounts for age, sickle- 
cell trait, and other malaria-protective host factors, we 
found no association between SMN1 copy number and 
malaria incidence. Specifically, the relative risk (RR) of a 
malaria episode (severe or mild) for each additional 
SMN1 copy is not significantly different from 1 (RR = 
0.97, 95% CI = 0.92-1.03,/- = 0.3). For comparison, 
these data are sufficient to show clearly the protective 
effect of sickle-cell trait (RR = 0.67, 95% CI = 0.58- 
0.76, p < 0.0001). Also, the change in average logio par- 
asitemia for each additional SMN1 copy number is not 
significantly different from 0 ( — 0.004; 95% CI = 
-0.046 to 0.039, p = 0.9). 

Regarding the possibility of locus duplication, we 
did not find increased copy numbers of other genes at 
the SMA locus (NAIPt and H4F5i), excluding the possi- 
bility of segmental duplication (data available on 
request). FISH analysis showed that all SMN1 copies are 
localized to the SMA locus at chromosome 5ql3.1 
(Fig 1). 

Effects of SMN Copy Number Differences 

We did not find significant increases in SMN mRNA 
level with increasing SMN1 copy number in 24 lympho- 
blast samples of Nigerian and Kenyan origin (p = 0.14; 
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TABLE 2. Frequency of SMN2 Copy Number in Different Geographical Regions 







Sub-Saharan Africa 






Europe 




SMN2 


Mali, 


Nigeria, 


Kenya, 


Germany, 


France, 


Sweden, 


Copy# 


n = 613 


n = 120 


n = 120 


n = 100 


n = 621 


n = 502 


0 


150 (24%) 


33 (27%) 


23 (19%) 


9 (9%) 


52 (8%) 


37 (8%) 


1 


276 (45%) 


66 (55%) 


53 (44%) 


38 (38%) 


239 (38%) 


212 (42%) 


2 


172 (28%) 


21 (18%) 


38 (32%) 


48 (48%) 


321 (52%) 


247 (49% 


3 


15 (3%) a 




6 (5%) 


5 (5%) 


9 (2%) 


6 (1%) 


Study 




Present study 




Anhuf 2003 23 


Corcia 20 12 20 



Three or 4 SMN2 copies. 



Fig 2A), in 15 fibroblast samples of unknown ethnic ori- 
gin from the Coriell Institute (not shown), or in 39 lym- 
phoblast samples of unknown ethnic origin from CEPH 
(p = 0.88; see Fig 2B). SMN protein levels measured by 
Western blot analysis also did not vary with SMN1 copy 
number in lymphoblasts or fibroblasts (not shown). iPS 
cells showed a trend toward increasing SMN mRNA lev- 
els with increasing SMN1 copy number, but this was not 
significant (p = 0.32; see Fig 2C). 

Discussion 

Until recently SMA was described as a "panethnic" con- 
dition, implying similar carrier frequencies and disease 
incidence worldwide. However, most SMA carrier fre- 
quency data have been derived from populations in the 
United States, Europe, and Asia. Most population-based 
studies of SMA carrier frequency have provided few or 
no data on ethnicity, and few studies have been per- 
formed in African populations. Studies on SMA patients 
in North Africa 24-26 show similar results to those in the 
United States and Europe. However, inherited neurologi- 



cal diseases in general and SMA in particular have been 
understudied in sub-Saharan Africa, in part because of 
the lack of genetic diagnostics. Although SMA type 1 is 
reported to be rare in black Africans, 7 ' 27 it is not known 
whether genetic modifiers of SMA are present in this 
population. To address this possibility, we investigated 
SMN copy number variation in healthy Malian adults. 
We sampled medical students at the University of 
Bamako because they are representative of the general 
Malian population in ethnic background and geographic 
origin, and research facilities are available for rapid onsite 
processing of blood samples. The low SMA carrier fre- 
quency we found in this population is consistent with 
the apparent rarity of SMA in Mali. 10 

The cause of high SMN1 copy number in sub- 
Saharan Africans relative to Eurasians remains to be 
determined, although our findings exclude such possibil- 
ities as gene conversion, selective pressure due to malaria 
resistance, and locus duplication. SMN1 to SMN2 gene 
conversion has been reported in SMA patients with late 
onset disease, milder phenotype, or both. 28-30 



TABLE 3. Frequency of SMN1, SMN2, and SMN Hybrid Genes Identified by Restriction Endonuclease Diges- 
tion and Sequence Analysis 


Origin of Samples 


SMN1 


Number of Clones Identified by Restriction Digestion 
SMN2 SMN Hybrid 


Total 






Mali, n = 20 


264 


40 49 (14%) 


343 






NIH, n = 12 


136 


22 15 (9%) 


173 






Number of Clones Identified by Sequencing 








SMN1 


SMN2 SMN1>2 Hybrid SMN2>1 Hybrid 


Total 






Mali, n = 20 


195 


2 3 46 (19%) 


246 






NIH, n = 12 


111 


3 4 9 (7%) 


127 





April 2014 



529 



ANNALS of Neurology 



TABLE 4. Incidence of Plasmodium falciparum 
Malaria Episodes, Stratified by SMN1 Copy Number 



Average No. of Malaria 
Categorization Episodes 



SMN1 Copy* 


No. 


Mild and 


Mild 


Severe 






Severe 






<2 (low) 


999 


2.85 


2.63 


0.21 


>3 (high) 


205 


2.89 


2.63 


0.26 



Conversely, SMN2 to SMN1 conversion is suggested by 
the finding that individuals with higher SMN1 copies 
tend to have lower SMN2 copies. 31 Although we were 
able to identify previously reported 13 ' 32 SMN1/2 hybrids 
in Malians and Nigerians, their frequency is too low to 
explain the high SMN1 and low SMN2 copy numbers in 
these populations. The relative prevalence of high SMN1 
(and low SMN2) copy number in sub-Saharan Africa, or 
low SMN1 (and high SMN2) copy number elsewhere, 
suggests that diseases with significant morbidity or mor- 
tality in these regions select for SMN variation. Although 
we found no overall association with malaria susceptibil- 
ity, other infectious diseases or environmental hazards 
may be selecting for SMN copy number variants. The 
finding that SMN1 copy number can vary from 0 to 4 
per chromosome in whites 14 suggested to us that locus 
duplication may have increased SMN1 copy number to 
>4 in Malians. Because pseudogenes may arise from ret- 
rotranscription 33 and be located at genomic loci distinct 




FIGURE 1 : Fluorescence in situ hybridization (FISH) analysis 
at the SMN locus. FISH analysis in lymphoblastoid cell line 
#19441 with a 4 SMN1/0 SMN2 genotype shows signal 
amplification of both copies on chromosome 5 and not 
elsewhere. 




SMN Copy Number 




SMN Copy Number 



C S iPSC 




SMN Copy Number 



FIGURE 2: Total SMN mRNA levels in different cells. (A) Cor- 
iell lymphoblasts (n = 3 with SMN1 copy number 1, 7 with 
copy number 2, 11 with copy number 3, and 3 with copy 
number 4). (B) Centre d'Etude du Polymorphisme Humain 
(CEPH) lymphoblasts (n = 33 with SMN1 copy number 2, 3 
with copy number 3, and 3 with copy number 4). (C) 
Induced pluripotent stem cells (iPSC; n = 8 with SMN1 copy 
number 2, 4 with copy number 3, and 3 with copy number 
4). Mean and standard error of the mean are shown. 

from their origin, we used FISH to analyze this possibil- 
ity in our Malian samples. FISH analysis showed SMN 
signal only at 5ql3.1, not at 6p21.3 (a paralogous locus 
containing NAJP exon 9, 34 which has a high degree of 
nucleotide similarity to SMN) or elsewhere. 

Given that gene conversion, natural selection by 
malaria, and locus duplication did not explain the rela- 
tively high SMN1 copy numbers in Malians, we considered 
potential roles for the high degree of consanguinity and 
the "bottleneck phenomenon." Data on genetic disorders 
in Arab populations extracted from the Catalogue of 
Transmission Genetics in Arabs database (http://www. 
cags.org.ae) indicate a relative abundance of recessive disor- 
ders associated with the practice of consanguinity, with a 
rate of first cousin marriage of 25 to 30%. 35 



530 



Volume 75, No. 4 



Sangare et al.: Low SMA Carrier Frequency in Africa 



Consanguinity was reported by 27% of our Malian partici- 
pants, and parental first cousin marriage was reported by 
17%. If consanguinity is causal, then we would expect sim- 
ilar SMA carrier frequencies and SMN copy number distri- 
butions in other populations with high consanguinity, but 
this has not been reported. For example, studies in Saudi 
Arabia and Egypt found an SMA carrier frequency of 1 in 
20. 36-38 Finally, high SMN1 copy numbers in sub-Saharan 
Africans may be due to the bottleneck phenomenon, 
meaning that by chance the population that migrated out 
of Africa to Asia and Europe had lower SMN1 copy num- 
ber or randomly drifted in this direction after the outmi- 
gration. 39 This may be the most likely explanation for the 
different SMN1 distributions in these areas. Regardless of 
the cause, our assessment of SMN copy number in differ- 
ent ethnic groups helps to appropriately target SMA carrier 
testing and provide accurate risk assessment to individuals 
from different populations. 

A fundamental question is how SMN1 copy num- 
ber variation affects gene expression and phenotypic 
traits. It has been reported that African populations have 
a higher frequency of NAIP duplication than Eurasians, 
with a relative increase in transcription of this gene. 
Although we detected no such correlation between 
SMN1 copy number and expression levels in lympho- 
blasts and fibroblasts, there was a trend toward correla- 
tion in iPS cells. The lack of correlation in lymphoblasts 
and fibroblasts may indicate feedback regulation to main- 
tain constant SMN mRNA and protein levels in these 
cells, or reflect a limit on gene transcription due to epi- 
genetic factors. Further study of this phenomenon in dif- 
ferent cell types under different conditions could help to 
identify factors responsible for regulating SMN1 expres- 
sion, which could be targets for therapeutics 
development. 
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